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^ 1 Introduction 

> 
O 

A frame on an n-dimensional difFerentiable manifold Q is a local basis of vector fields 
£^ {^j}) i = l,2,...,n. A frame consisting of coordinate vector fields d/dq^ is called, 

naturally enough, a coordinate frame, or more obscurely a holonomic one; a frame which 

is not holonomic, or not known to be so, is said to be anholonomic. The term dates 
^\ back at least as far as Schouten's Ricci- Calculus [32 . Why one says anholonomic for 

frames but nonholonomic for constraints in dynamics, with closely related meanings, is 

>- anybody's guess. 

• ^ 

^ In order to test whether or not a frame {Xi] is a coordinate frame (that is, whether 

coordinates may be found such that Xi = d/dq^) one computes the brackets [Xj,A'j]. 
These vanish for a coordinate frame; for an anholonomic frame we may write 

for some locally defined functions R^j on Q. These functions are collectively called by 
Schouten the object of anholonomity, because of their role in determining whether or not 
the frame is truly anholonomic. 

Anholonomic frames can be very useful in certain situations in dynamics because they 
can be adapted to geometrical requirements in a way that may be impossible with a 
coordinate frame. We have exploited this possibility in relation to systems with symme- 
try in previous publications [HI HSJ [131 El EH]. In this paper, we promote the use of 
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anholonomic frames in the study of dynamical systems subject to nonholonomic linear 
constraints. 

Our paper is concerned entirely with dynamical systems of finitely many degrees of 
freedom which are capable, broadly speaking, of a Lagrangian formulation; and it studies 
the systems of interest by uncompromisingly differential-geometric means. One instance 
of this is that, for us, the dynamics is always represented by a vector field; the possible 
motions of a dynamical system are given by the integral curves of the dynamical vector 
field, and the dynamical differential equations are those that determine the integral 
curves of the dynamical vector field. When one is dealing with a Lagrangian system 
(without constraints, for simplicity) one has therefore to interpret the Euler-Lagrange 
equations as implicitly defining a vector field: this will in fact be a vector field of special 
type — a so-called second-order differential equation field — on the tangent bundle TQ 
of the configuration space Q. Special techniques have been developed for the study of 
the differential geometry of tangent bundles and second-order differential equation fields, 
which have been shown to be useful in many applications, such as the inverse problem of 
Lagrangian mechanics and the study of qualitative features of systems of second-order 
differential equations. 

Our insistence on always thinking of the dynamics as a vector field of second-order 
type on TQ is one distinctive feature of our approach. A second is our whole-hearted 
use of anholonomic frames. For example, we reformulate the Euler-Lagrange equations 
(which of course are normally expressed in terms of coordinates) in a frame-dependent 
but entirely coordinate-free manner. To do so we first show how to lift an anholonomic 
frame on Q to one on TQ. In keeping with our differential-geometric approach we do not 
base our discussion of Euler-Lagrange equations on a variational principle, but instead 
derive the equations ultimately by the method of the Cartan form (see [15], for example, 
for a general description, and Section 2.3 for one adapted to our needs in this paper). 

The concept of quasi-velocities has a very natural place in our theory — indeed, we would 
claim that the only way to understand quasi-velocities properly is to view them from the 
perspective of the theory of anholonomic frames. The thoroughly misleading — indeed, 
incoherent — concept of quasi-coordinates to be found in some classical textbooks on 
mechanics is thereby avoided. It is argued in a recent paper by Bloch et al. [5^ that 
although the benefits of the use of quasi-velocities have always been beyond any doubt, 
the mathematical foundations of the theory in textbooks such as [20] and [30j have not 
always been built up as rigourously as they should. We agree; and in fact we take a more 
radical approach to the problem than do the authors of [5]. For example, the nearest 
these authors come to a frame-based version of the Euler-Lagrange equations is Hamel's 
equations. We on the other hand view Hamel's equations, however useful they may be in 
practical applications, as just a half-way house — partly frame- and partly coordinate- 
based. We show in our paper how to derive Hamel's equations, but we make no use of 
them. 

The differential-geometric machinery necessary for our approach is described in Sec- 
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tion 2.1 below. Our version of the Euler-Lagrange equations is obtained in Proposition 1 
in Section 2.2. In Section 2.3 we show how to obtain these equations by means of the 
Cart an forms. 

There has been a long discussion in the literature about two distinct approaches to 
dealing with dynamical systems subject to nonholonomic constraints, which are called 
respectively nonholonomic and vakonomic mechanics. According to [21 |8l |3T] the discus- 
sion dates back to the end of the nineteenth century, with Korteweg, Holder, Voronec 
and Suslov among its more famous participants. Nonholonomic mechanics is the clas- 
sical method for deriving equations of motion for systems with constraints and their 
associated reaction forces from the Lagrange-d'Alembert principle, while the equations 
for vakonomic systems (sometimes also termed variational nonholonomic systems) follow 
from a variational principle where one looks for extremals in the class of curves which 
satisfy the constraints. Experiments such as the one described in suggest that the 
nonholonomic equations of motion are the correct ones to use for mechanical systems with 
nonholonomic constraints. The vakonomic theory, on the other hand, finds applications 
in such fields as economics and LC-circuits (see for example |10)). 

In Section 3 we show in Proposition 2 how our methods may be used to formulate a 
version of the Lagrange-d'Alembert principle and derive the nonholonomic dynamics 
for a system subject to nonintegrable linear constraints, provided it is regular in an 
appropriate sense. 

In Section 4 we turn our attention to vakonomic systems. We derive the equations 
satisfied by the dynamical vector fields of such systems. In Propositions 3 and 4 we 
give a version of these equations, when the multipliers are required to lie in the image 
of a certain section. Section 5 concentrates in much detail on the question of when the 
nonholonomic problem is consistent with the vakonomic one. This question has of course 
been investigated by many authors in the past (see for example [3 [TOl [ISl HHJ EH EH [3l] 
— this list is by no means exhaustive). By carefully analyzing the dynamical vector 
fields involved we show that in fact two notions of consistency need to be distinguished. 
The corresponding consistency conditions can be found in Theorem 2 and Corollary 1, 
repectively. Inspired by a recent paper of Fernandez and Bloch [19] we examine the 
existence of a so-called 'variational Lagrangian' and we clarify its role with regard to the 
consistency conditions (Theorem 1 and Corollary 2). We considerably generalize some 
of the results of [19^ , stated there for Abelian Chaplygin systems only, by showing that 
they apply also to non- Abelian Chaplygin systems (in the sense of e.g. [23]) and even in 
some cases to nonholonomic systems with linear constraints in general (see Propositions 5 
and 6). Our discussion of Chaplygin systems is to be found in Section 6. As in earlier 
sections, the ease with which we obtain the results relies on the choice of an appropriate 
frame. In the case of a Chaplygin system the frame is adapted to the situation of a 
system with symmetry. In the literature one can, of course, also find other approaches to 
(nonholonomic and vakonomic) systems with symmetry and their reduction, e.g. using 
the theory of Lie algebroids [Q] EH |29]; but the benefits of our methods are of course not 
available there. 
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The paper ends with some illustrative examples, namely the class of nonholonomic sys- 
tems discussed recently in and the two- wheeled carriage. We give new and illumi- 
nating derivations of some known results. We also correct the errors which have crept 
into some of the accounts of the two-wheeled carriage in the literature. 

2 Anholonomic frames, quasi-velocities and the Euler- 
Lagrange equations 

In this section we describe the basic constructions associated with anholonomic frames, 
leading to an appropriate formulation of the Euler-Lagrange equations, in preparation 
for the discussion of nonholonomic systems which commences in the following section. 

2.1 Some aspects of tangent bundle geometry 

We shall be concerned with second-order dynamics, that is to say, with dynamical systems 
represented as vector fields of second-order type on velocity phase space, which is the 
tangent bundle TQ of configuration space Q. We denote the space of sections oiTQ — > Q, 
that is the space of vector fields on Q, by ^'(Q); it is a C°°((5)-module, where C°°[Q) 
denotes the ring of smooth real- valued functions on Q. (Actually all considerations in 
this paper are local rather than global, but we shall not continually draw attention 
to this fact.) Though in principle we prefer to manage without coordinates, we shall 
want to use them sometimes, especially in this section: we point out that we denote the 
generalized velocities, in other words the fibre coordinates on TQ naturally associated 
with coordinates on Q, by u*, reserving for the actual derivative of along a curve. 
Our policy in this section is to give informal but intrinsic definitions of objects and 
constructions so that it will be clear that they are well-defined, but to supplement these 
with coordinate expressions for security: this is why we make more use of coordinates 
here than later. 

We shall work with anholonomic frames on Q. Our first task is to show how to lift 
an anholonomic frame from Q to TQ, to give an anholonomic frame there. Since the 
dimension of TQ is 2n we need to double the number of vector fields in the frame. 
Fortunately there are two canonical ways of lifting a vector field from Q to TQ, the 
so-called complete and vertical lifts; we apply them both to each member of the frame 
on Q. These constructions are described in detail in several texts, including [15]; we give 
a brief account below for the convenience of the reader. 

Let Z be any (locally defined) vector field on Q, with flow (pt. By taking the tangent 
map to (ft for each t we obtain a flow on TQ\ the corresponding vector field on TQ is 
called the complete lift (sometimes the tangent lift) of Z and is denoted by Z^. In terms 
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of coordinates 

The second canonical way of lifting the vector field Z from Q to TQ yields its vertical 
lift Z^: this is tangent to the fibres of the projection r : TQ Q, and on the fibre over 
q e Q it coincides with the constant vector field Zq. In coordinates 

We have t*Z° = Z, which is to say that Z^ is projectable and projects onto Z, while 
T^Z^ = since Z^ is vertical. The brackets of complete and vertical lifts of vector fields 
Y and Z are 

[y^, z«] = [Y, zf, z^] = [Y, zy, [y^, z^] = o. 

Note that although the map Z i— >■ Z^ is C°°(Q)-linear, the map Z t-^ Z'^ is only M-linear: 
in fact for / G C°°(Q) 

{fzr = fz''+fz^ 

where / is the so-called total derivative of /, a function on TQ defined by f{q, u) = u{f) 
and given in coordinates by 

■' dq' 

This fact will have an important role to play shortly. (The first term on the right of the 

equation above for {fZ)'~' should strictly speaking be {t* f)Z'^, but we shall not bother 
to distinguish notationally between a function on Q and its pull-back to TQ.) 

Given an anholonomic frame {Xj} on Q we can construct from it the anholonomic frame 
{X^,X^} on TQ. This lifted frame will play a central role in our account of dynamics 
on TQ. 

With any local frame {Xi} there is associated its dual coframe {0^}, consisting of locally 
defined 1-forms such that 9^{Xj) = 5j. Now a 1-form 6 on Q defines a function say vq on 
TQ, linear on the fibres, by vg{q, u) = 9q{u); if 6 = Oidq^ then vq = OiU^. The n functions 
vg. associated in this way with a frame via the dual coframe are called the quasi-velocities 
of the frame; we denote them by for simplicity. Another way of defining the quasi- 
velocities is to say that v^{q, u) is just the zth component of the vector u G TqQ when it 
is expressed in terms of the frame at q. 

We shall need expressions for the derivatives of the quasi-velocities by complete and 
vertical lifts, and in particular their derivatives along the members of the lifted frame 
{Xf,Xy}. To find them, the following two formulas are indispensable: 
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Prom the second of these, Z^{v^) = Z*, where (for any vector field) = 9^{Z) is the ith 
component of Z with respect to the frame. From the first, Z'^(v^) = —[Z,XjYv^. Thus 
we can express X^{v^) in terms of the object of anholonomity: if = RfjXk then 

On the other hand 

x7{v^) = si, 

from the second formula displayed above. 



2.2 The Euler-Lagrange field 

We now turn our attention to dynamics. Wc deal with second-order systems, which means 
that we expect to be able to write the dynamical equations as second-order ordinary 
differential equations of the form = F^{q,q). As we explained in the introduction, 
we follow the time-honoured principle of differential geometry that a suitable system 
of ordinary differential equations should be replaced by the vector field whose integral 
curves are its solutions. A system of equations of the form = F^(q,q) corresponds 
in this way to a vector field T on TQ, but one of a special character: we must have 
'''*{q,u)^ = which is to say that T must take the form 

in coordinates, for then the equations for its integral curves are 

as required. Such a vector field is, naturally enough, called a second-order differential 
equation field. 

One useful property of second-order differential equation fields, which is most easily 
checked by a coordinate calculation, is that for any vector field Z onQ, [Z'^,T] is vertical. 

Suppose wc have an anholonomic frame {Xi} on Q; we can then express any vector 
field on TQ in terms of the lifted frame {X^ , X^}. Since r^Xf = Xi, the condition for 
a vector field to be a second-order differential equation field, when expressed in these 
terms, is that it should take the form 

v'x^ + rx^ 

for some functions on TQ, where the coefficients are the quasi- velocities of the 
frame. 

Note that for any function / on Q and any second-order differential equation field F 
we have F(/) = /. Recall that the total derivative / also appears in the expression 
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for (fZ)'^", which suggests the possibihty of constructive cancellation. Let T be any 
second-order differential equation field and L any function on TQ, and consider the map 
e : X{Q) C°°{TQ) given by 



e{Z) = r(Z^(L)) - 



Then e is evidently M-linear; it is in fact C°°((5)-linear, as the following calculation shows: 



eifZ) = T{fZ^{L)) - {fZf{L) = fZ^{L) + fT{Z^{L)) - fZ-{L) - fZ^{L) = fs{Z). 



Thus e behaves like a 1-form, except that it takes its values in C°°{TQ) rather than 
C°°(Q): it is in fact a 1-form along the tangent bundle projection r. 

A regular Lagrangian L determines a second-order dynamical system via its Euler- 
Lagrange equations. These are usually written as 



When L is regular, which means that its Hessian with respect to the velocities is non- 
singular, these equations can be solved for the q'', or in other words written in the form 
= F'^{q,q). They correspond to a second-order differential equation field T, therefore. 
In fact there is a unique second-order differential equation field T such that 



and its integral curves are the solutions of the Euler-Lagrange equations. But in view of 
the remarks above about e, we see that T must in fact satisfy 



for any vector field Z on Q. Moreover, to determine F, assuming that it is a second-order 
differential equation field and that L is regular, it is enough to require that 



for the vector fields Xi of any frame on Q, even an anholonomic one. To be explicit, we 
may take T = v'^Xf + V^xy , when the equation above becomes 



Now Xy{Xy{L)) are just the components of the Hessian of L expressed in terms of 
the anholonomic frame (recall that vertical lifts commute): so these equations uniquely 
determine the coefficients T\ as required. We sum this discussion up in the form of 
a proposition, which we regard as the fundamental statement of regular Lagrangian 
dynamics. 




fdL \ _ dL_ 
\ du^ J dq^ 



r(Z^(L)) - Z'^(L) = 



nXj{L)) - Xf 



(L) = 0, i = l,2,...n. 



XJ{X]{L))T^ = Xf{L) - v^XfiX-iL)). 
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Proposition 1. Let L be a regular Lagrangian on TQ. There is a unique second- order 
differential equation field F such that 

r(Z^(L)) - = 

for all vector fields Z on Q. Moreover, T m,ay he determined from the equations 

r(X7(L))-Xf(L) = 0, i = l,2,...n, 

for any frame {Xj} on Q (which may he a coordinate frame or may he anholonomic) . 

The integral curves of T are the solutions of the conventional Eulcr-Lagrange equations 
of L, and we may therefore regard the first displayed equation in the proposition as the 
most general form of the Euler-Lagrange equations, and the second displayed equations 
as the Euler-Lagrange equations relative to a frame. 

Note that we do not assume that the Lagrangian is of mechanical type: the remarks 
above apply to any Lagrangian, provided it is regular. 

These general forms of the Euler-Lagrange equations are very useful in the discussion of 
theoretical questions, but in any specific particular case it will eventually be necessary 
to introduce coordinates. One way of doing so, which partially takes cognizance of an 
anholonomic frame, is to use some arbitrary coordinates on Q, but the quasi- velocities 
■u' of the frame for fibre coordinates. We next write the Euler-Lagrange equations for the 
frame in terms of such coordinates. To do so we need to express Xf and XJ in terms of 
the coordinate vector fields, and this is most easily done by simply evaluating them on 
the coordinates. We have 

Xf{q^) = XM^)=Xl X^{v^) = -Iii,v\ XV(g^) = 0, X-(v^)=6i, 
where Xi = X^d/dq^. It follows that 

The Euler-Lagrange equations become 

which are Hamel's equations in our notation. 



2.3 The method of the Cartan form 

The discussion above is limited to regular Lagrangians. Later in the paper we shall 
deal with a variational problem (the vakonomic problem) for which the Lagrangian is 
very definitely not regular, and so the methods developed so far will not apply. To 
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cope with this situation we shall use the method of the Cartan forms, which are in fact 
globally defined forms. However, once again we need a formulation in terms of a (local) 
anholonomic frame, which makes the following account unusual in some respects. 

We consider any Lagrangian L on any TQ equipped with the frame lifted from an an- 
holonomic frame {Xi} on Q whose dual is {^'} and whose quasi-velocities are u*. The 
Cartan 1-form of L is XJ^Vjd'^, the Cartan 2-form 

uj = d{xj{L)e^). 

The energy of L is £^ = v'^XJ^L) — L, and any vector field T satisfying 

Fj a; = -dE 

is an Euler-Lagrange field of L. A word of caution is required: once again we have not 
distinguished notationally between an object on Q and its pullback to TQ. Strictly 
speaking we should write Xy (L)t*9'^ for the Cartan 1-form. Bearing this in mind it is 
easy to see that for any 1-form and vector field Z on Q, 

e{z'') = 9{z), e{z^) = o, z^jde = zjde, z^jd9 = o. 

In the case of the anholonomic frame and its dual, the third of these leads to 

J de^ = Xij do^ = Cx.e'J - d{e\Xi)) = -b'j^. 

The equation Taoj = —dE is of course an equation between 1-forms on TQ. Evaluating 
it on Xy and Xf in turn leads to the pair of equations 

xnx]{L)) {e\T) - v^) = 0, r{x-{L)) - xm = o. 

Prom the first of these, ^•' (r) = when L is regular, which says that F is a second-order 
differential equation field; and then the second gives the Euler-Lagrange equations rel- 
ative to the frame. But the equation Tjui = —dE stands even when the Lagrangian 
is not regular; and in particular any Euler-Lagrange field F satisfies the equations 
T{Xy{L)) — X^{L) = 0, though these equations may not be enough to determine an 
Euler-Lagrange field uniquely. 

3 Nonholonomic dynamics 

We now consider a dynamical system subject to nonholonomic linear constraints. There 
are two (equivalent) ways of specifying such constraints: as a distribution T> on Q (the 
constraint distribution), such that at each q & Q, Vg has the same dimension m, and 
which is not integrable (in the sense of Frobenius); or as a submanifold C of TQ (the 
constraint submanifold) which intersects each fibre in a linear subspace Cq, again of 
constant dimension m; of course Cg and Vg are the same, just viewed from slightly 
different perspectives. 
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We can choose an anholonomic frame {Xi} adapted to this situation by taking its first 
m members, say {Xq}, a = 1, 2, . . . , m, to span D. We write the remaining members of 
the frame as {Xa} , a = m+l,m + 2,...,n. Let be the corresponding quasi- velocities: 
then C is the submanifold where v"- = 0. 

We shall be interested in vector fields defined only on the constraint submanifold C. 
A vector field F on C (which for the purpose of the following definition could be any 
submanifold of TQ) will be said to be of second-order type if it satisfies the defining 
condition for second-order differential equation fields, r^^^ ^^F = u, for all (q, u) € C. 
We shall furthermore be interested in vector fields not only defined on the constraint 
submanifold C, but also tangent to it. A vector field F on C will be tangent to C if and 
only if V[v°') = 0, a = m-|-l,m-|-2, ...,n. A vector field F on C which is of second-order 
type takes the form 

F = v'^x'^ + rx^ 

for some functions F* on C Now 

XSK) = -i2>^ = -KXonC, 

so v°^X^{v°') = since R^^ is skew-symmetric in its lower indices. But XJ{v°') = 5?, so 
for F to be tangent to C we must have F" = 0. That is to say, a vector field F on C which 
is of second-order type and which is tangent to C takes the form 

F = + V^Xl 

with respect to the lifted anholonomic frame. 

It is interesting to note that F, in the form just derived, depends only on those vector 
fields of the frame which span T>. Let us derive its representation with respect to another 
frame {5^}, where again the span T>. Then 

Y^ = AiXp, Ya = A\X^ + AlX^, 

where the square matrices (^a) and (^a), whose entries are local functions on Q, are 
non-singular. The quasi-velocities uf" corresponding to the new frame are given by 

where the overbar indicates the inverse matrix. Note that the level sets = and 
■u;" = coincide, and that where = 0, u;"" = A^v^ . After a short calculation we find 
that 

F = v'^Xl + V'^Xl = + A'liT'^ - A^w'^)Y^. 

This shows that indeed any vector field of second-order type on the constraint subman- 
ifold C which is tangent to C can be expressed entirely in terms of a local basis for D, 
and incidentally gives the transformation rule for the coefficients F" under a change of 
such a basis. 
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We now suppose given a Lagrangian L on TQ. We say that L is regular with respect 
to the constraints if for any local basis {^a} of P, the symmetric m x m matrix whose 
entries are functions on C, is nonsingular (this condition is easily seen to be 

independent of the choice of basis). 

Proposition 2. LetC C TQ be the constraint submanifold for a system of nonholonomic 
linear constraints, T> the corresponding constraint distribution, and L a Lagrangian on 
TQ which is regular with respect to D. Then there is a unique vector field T on C which 
is of second-order type, is tangent to C, and is such that on C 

r(Z^(L)) - Z°{L) = 

for all Z ^T>. Moreover, T may be determined from the equations 

r(X^(L))-X^(L) = 0, a = l,2,...m, 

on C, where {Xa} is any local basis for V. 

Proof. For any L and F, the map Z ^ r(Z^(L)) — Z'^{L) is C°°((5)-linear, just as before. 
Thus r(Z^(L)) - ZC(L) = if and only if T{X^{L)) - X^{L) = for a local basis 
But r must take the form 

r = «^ + r"x^, 

so that 

which determines by the regularity assumption. □ 

We regard the content of this proposition as the fundamental statement of regular non- 
holonomic dynamics; it is our version of the Lagrange-d'Alembert principle. The dis- 
played equations in the statement of the proposition are the fundamental equations of 
regular nonholonomic dynamics. We could express the main content of the proposition 
in terms of the map e : X(Q) C°°(TQ) introduced earlier by saying that e takes its 
values in V^, the annihilator of V. To be more precise: is a linear subbundle of T*Q, 
and £, which in general is a 1-form along the tangent bundle projection r, according to 
the proposition is a section of TqV^ (where tq '■ C ^ Q is the restriction of r to C). 

For the proposition we do not need a full anholonomic frame. What purpose might the 
remaining vectors {Xa} of such a frame serve, one might ask. Here are two uses for 
them. 

In many formulations of the equations of nonholonomic dynamics, multipliers appear 
(see |5j for example). Having determined F by the method of the proposition, we may 
then form the expressions F(XJ(L)) — X^{L). These will not be zero: let us set 

T{X:iL))-X^iL) = Xa. 
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We have Xi = X^-d/dq^ with {X^) nonsingular. Then the coordinate version of the full 
set of equations reads 

fdL\ dL _ 

where X^ are the appropriate entries in the matrix inverse to (Xj). In fact v'^ = Xfu^, 
which explains the significance of these coefficients: the constraint equations are just 
X?it* = 0. We emphasise that the multipliers Xa are determined once one has chosen 
a full anholonomic frame adapted to the constraint distribution and found T from the 
fundamental equations. In fact the multipliers are the components of e with respect to 
the coframe dual to the (full) frame; the notional components with subscript a vanish 
because e takes its values in V^. 

The fundamental equations T{X^{L)) — X^(L) = involve differentiations of L in direc- 
tions transverse to C. Actually X^iv"') = 0, so the restriction of X^ to C is tangent to it. 
But as we pointed out earlier, X2{v°') = —W^/^v^ on C. Now is the component of 
[Xo, X^] along Xa, and in the case of genuinely nonholonomic constraints some of these 
components will be nonzero; so we must expect that X^ will not be tangent to C. Using 
the Xa we can split X^ into a component tangent to C, which we denote by X^ (the 
notation is not intended to imply that this is a complete lift, nor that an inverse matrix 
is involved), and a component transverse to it. Since Xa{v^) = Sa, we see that 

is tangent to C. Since v^'X^ = v°'X2, the vector field T is in fact of the form T = 
v'^X^ + r"X^. We may therefore write the fundamental equations as 

r(X^(L)) - X^{L) = -RlpV^XliL)- 

now every term on the left-hand side depends only on the value of L on C Let us denote 
the restriction of L to C by Lc, this is often called the constrained Lagrangian. Then the 
version of the fundamental equations above may be written 

nXl{L,)) - X^{L,) = -KpV^X-aiL). 

The equations appear in a somewhat similar form to this in fT9^ (Equation (1.6)). 

Finally, we can easily write the fundamental equations in Hamel form: they are 

dL \ . dL gdL ^ 

(compare with [S] Theorem 3.2). 



4 Vakonomic systems 

The term 'vakonomic mechanics' was introduced by Arnold and Kozlov in [1] |25| and 
stands for 'mechanics of Variational Axiomatic Kind'. The theory was proposed as an 
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alternative to the approach to nonholonomic dynamics discussed in the previous section. 
A fundamental reference in this context is the book chapter [33] by Vershik and Ger- 
shkovich. Since more and more evidence seems to suggest that the vakonomic equations 
do not give the correct equations of motion, some authors have refrained from using the 
term vakonomic mechanics and prefer to call these systems 'variational nonholonomic 
systems' (see e.g. [2 ). We shall follow the majority in using the word 'vakonomic', but 
avoid controversy by talking of 'vakonomic systems', 'vakonomic problems' and so on, 
but never mentioning 'vakonomic mechanics'. 

One idea behind the formulation of vakonomic systems is to think of the multipliers as 
additional variables. Recall that the multipliers are in fact components of a 1-form e 
(along a certain projection) which takes its values in C T*Q, the annihilator bundle 
of the constraint distribution D. We therefore take T)^ as state space for the vakonomic 
system. The Lagrangian of the vakonomic system will therefore be a function L on TV^. 
It is constructed as follows. First, we are given a Lagrangian L on TQ; but TD^ projects 
onto TQ, so we can pull L back to TP" (and as before we denote the pulled-back function 
by the same symbol L). Secondly, every point /x of over g G Q is a covector at q, 
and so defines a linear function on TqQ; we can therefore define a function M on TV^ 
by M{q, ii,u,v) = //(u) (note that M is independent of the second fibre component v). 
Then 

L = L-M. 

In keeping with the philosophy of the rest of this paper, we now express L in terms of 
an anholonomic frame. We introduce an anholonomic frame {Xa,Xa} adapted to the 
constraint distribution as before. By doing so we effectively identify with Q x M"""*, 
or in other words we fix fibre coordinates jjLa on P", which are the components of /x 
with respect to the coframe dual to the chosen frame. This identification implies a local 
character of some of our results. For a more intrinsic formulation, see e.g. |10| [33] . The 
Lagrangian L is given as a function on T{Q x M"^™) by 

L = L- HaV" 

where the v"" are quasi-velocities, as before. 

We assume that L is regular. But even so L fails decisively to be regular, so we cannot 
obtain its Euler-Lagrange equations simply by applying Proposition [T] We shall instead 
use the method of the Cartan form, as outlined in Section 2.3, to obtain them. 

We may augment our frame {Xi\ on Q to a frame on Q ^ M"""* simply by adjoining 
the coordinate fields d/d^a- The vector fields Xj, now interpreted as vector fields on 
Q X M""*", act in the same way as before on the coordinates of Q and have the property 
that Xi{^a) = 0. Since L does not depend on the velocity variables corresponding to the 
Ha its Cartan 1-form is 

xnL)e' = xnL)e'-iJ^aO''. 

For the same reason, E = E (the energy of L). The equation Taui = —dE, which is an 
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equation for a vector field or fields f on T{Q x M" becomes 

f J {uj - dinaO")) = -dE, 

where uj is the Cartan 2-form of L (properly speaking, pulled back to T(Q x M"'"'")). By 
evaluating the equation on Xy we obtain 

whence ^■'(r) = since we assume that L is regular. The coefficient of the term in d/ia 
must vanish, whence 9"'{T) = = 1;": that is to say, the equation Tju = —dE has no 
solution except where v"^ = 0, that is, except on C x TM"""*. Finally, evaluating the 
equation on and X^ successively gives 

f(X-(L))-XS(L) = /.„i?«y, 

f (XJ(L)) - X^{L) = ^l,Rl^v^ + f (Ma). 

We shall sometimes combine these two sets of equations into one by writing 

f(X7(L)) - Xf (L) = fiaRtaV- + t{l,a)St. 

Now f is a vector field on C x TM"""' C T{Q x M""""). It is natural to decompose it 
according to the product structure, say F = Fc + r^. The equation displayed above then 

becomes 

Tc{Xr{L)) - Xf{L) = i^aRfaV'' + ^MSf- 

We may construct Euler-Lagrange fields F = Fc + F^ for the vakonomic problem as 
follows: F^ is completely undetermined; but a choice of F^ having been made, Tc will in 
favourable circumstances be determined by the equation above. 

Since the velocity variables corresponding to the fia do not appear in the equations, and 
in fact play no part at all in the theory, we propose to ignore them; that is to say, we 
shall replace T{Q x M'^"'") by TQ x M""™ and C x TM"-" by C x M""™. The frame 
{Xi,d/dfia} can be lifted to a frame {Xf,Xy,d/diJ,a} (and in principle also d/dua, but 
we ignore these in view of what was said above). In this frame, the vector fields X^, 
for example, should be interpreted as vector fields on TQ x M""™": again, they act in 
the same way as before on the coordinate functions x^,v^ and have the property that 

^f(Ma)=0. 

Furthermore, the fact that the Ha are variables is really rather an embarrassment; we 
would prefer to think of them as functions on C, or in other words to take a section (f) of 
the projection C x M""""* — C and restrict our attention to its image im(^). 

Any Euler-Lagrange field F, restricted to im((^), will take the form F = Fc + F^ where 
since Tc is of second-order type 

Fc = «S + F"X^ + F"Xj, and F^ = ^„-^; 
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all coefficients are functions on C. Notice that though Tc is here defined on C, there is 
no reason to believe that it is tangent to it; hence the inclusion of the term in X^. 

We may conclude: 

Proposition 3. // a vakonomic Euler-Lagrange field T is decomposed as above, the 
restriction of the vakonomic Euler-Lagrange equations to im(^) may be written 

Tc{Xl{L)) - = <t>aKpV^ 

Vc{XZ{L)) - X^{L) = hR^v'^ + Aa. 

We shall always use the vakonomic equations in this form. 

The first point to note is that provided L is regular, so that the Hessian of L is non- 
singular, for given (j) and Aa these equations determine Tq as a vector field on C C TQ. 
We next show that when further regularity conditions are satisfied we can choose Aa 
such that Vc is not just a vector field on C but a vector field tangent to C: that is, we 
can choose Aa such that = 0. 

For convenience we denote the components of the Hessian of L with respect to the frame 
{Xi} by gij-. 

Xl{X}{L)) = g^^, Xl{X-{L)) = g^a, Xl{X^{L)) = g^,. 

The gij are symmetric in their indices. In discussing the Lagrange-d'Alembert principle 
we imposed a regularity condition on L, namely that the submatrix {gap) of its Hessian 
must be nonsingular on C: we said that L is then regular with respect to P. Assuming 
that L is indeed regular with respect to D, let (17"^) be the matrix inverse to {gap)- In 
the following argument we must assume that {gab ~ d"'^ 9aa9bp) is nonsingular on C. This 
matrix is in fact the restriction of the Hessian {gij{q,u)) to the subspace of TqQ which 
is orthogonal to T>q with respect to it, so when this matrix is nonsingular we say that 
L is regular with respect to T)-^. When {gij) is positive definite, {gab — g"'^ gaagb/s) will 
automatically be nonsingular, as will {gap) be; indeed, both will be positive definite. So 
when {gij) is positive definite L will automatically be regular with respect to both V and 
V-^, but for Hessians of other signatures we need to make the assumptions explicit. 

The equations for Tc can be written 

gapTf^ + 9abT'' = Ya, 
gapTf^ + gab^'' = Ya + Aa, 

where Yq, and Ya are known expressions in L, the Xi, (j), the R*^-, etc. Using the assump- 
tion that gap is nonsingular we can eliminate F^ between these equations, leaving 

{9ab - 9''^9aa9bp)r^ = Aa + Ya - gaa9''%- 

So when {gah - g"^gaa9bp) is nonsingular, by taking Aa = gaa9°'^Yp - Ya we ensure that 
F" = as required. 
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The resultant vector field Tc, which is the projection onto C of the restriction of f to 
im[(f)), is determined by the equations 

rc(X-(L))-X«(L) = </,„i?«X. 

Of course, it will also satisfy 

TciX^iL)) - X^iL) = cP^Rl^v- + Aa 
(with Aa = gaa9°''^Y/3 - Ya); that is to say, 

f = Tc + = v'^x^ + r"x^ + Aa^ 

will satisfy the vakonomic equations on im(^). 

Let us set rc(Xj(L)) - X^^L) = Aa. Then A^ = Aa - (t>bRLv"- 

Now r, so determined, is a vector field on im((/>). A natural question to ask is whether 
it is tangent to im{(j)), that is, whether r(//a — 4>a) = for //„ = But r(/Xa — 0a) = 
Aa — Tc{(l)a)- The condition for tangency is therefore 

rcicj>a) + 4>bRLv''=^a. 

We can summarise this discussion as follows. 

Proposition 4. Assume that L is regular with respect to both D and . Let (p : jJLa — (pa 

he a section of C x W^~'^ C. There is a unique vector field T on the image of (f) which 

is a solution of the vakonomic problem there, and is such that its projection onto C, Tq, 
is tangent to C. The vector field T so determined is tangent to the image of (p if and only 

ifTc{4>a)+<t>bRlaV'' =K. 

One might very well query the importance of the requirement of tangency to im(0): but 
consider the following. 

Suppose that the (j)a can be continued ofFC, that is, suppose that there are functions 
defined in a neighbourhood of C in TQ such that (pa = ^a\c- This can always be done 
locally: since C is given by t;" = it is enough to make independent of the v"". Suppose 
that the Aa have been chosen so that Vc is tangent to C; then Vc{*^a) = ^c{<Pa)- Consider 
the Lagrangian L = L — ^av"'j which is, note, a function on some neighbourhood of C in 
TQ. We have 

X^iL) = XUL) - XU^aK - ^a5f , 

Xf{L) = Xf{L) - + ^aR%vi, 

whence on C 

Vc{Xl{L)) - X^{L) = rc(X^(L)) - X^{L) - <PaRlpvP = 0, 
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while 

Thus the tangency condition is the necessary and sufficient condition for Tq to satisfy 
the Euler-Lagrange equations of L on C (and this for any extension of the (j)a off C). That 
is to say: 

Theorem 1. If there is a section (f) such that T = Tq + T ^ is tangent to the image of 
(j), where is chosen such that Tq is tangent to C, then Tq is the restriction to C of 
an Euler-Lagrange field of L for any extension of the (pa off C. Conversely, let T be an 
Euler-Lagrange field of L = L — ^av'^,' suppose that T is tangent to C. Let (pa = ^a\c ^n-d 
consider the section (p : Ha = (pa- Then (p^:T^, where f is the restriction ofT to C, is a 
solution of the vakonomic problem on the image of (p. 



5 The consistency problem 



We now turn to the vexed question of whether the nonholonomic and vakonomic problems 
can ever be in any sense consistent. As we have shown, the vakonomic problem leads to a 
whole class of dynamical vector fields T defined on C x M"^*" but not necessarily tangent 
to it, the Lagrange-d'Alembert principle to a single vector field T defined on C and 
tangent to it. The question of how one decides whether there can be any coincidence 
between such vector fields is therefore somewhat puzzling. We propose the following 
answer, under the assumption that L is regular with respect to both V and V^: we say 
that the vakonomic and the nonholonomic problems are weakly consistent if there is a 
section (p of the projection C x M""™ — > C such that the corresponding vector field Tq 
defined in Proposition |4] is the nonholonomic dynamical vector field T. (We shall propose 
a stronger criterion for consistency shortly.) 

Theorem 2. The following three statements are equivalent. 



(i) The variational nonholonomic problem and the nonholonomic dynamics are weakly 
consistent. 

(a) There is a section (p of the projection C x M"-"™- C such that (paR'^^v^ = 0. 

(Hi) There is a section (p of the projection C x M""™ C such that the vector field T 
on im((p) given by 

f = r + {Xa-(PbRLvn^, 

where the Xa are the multipliers for T, satisfies the vakonomic equations on im(0). 
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Proof. As we pointed out earlier, Tc is determined by the equation 

rc(X^(L))-XS(L) = «X 

on C. Thus Tq will coincide with T if and only if (paR'^f^v^ = 0. 
On the other hand f satisfies 

f{X:{L))-X^{L)=0, 

on im((/)). Comparing with the vakonomic equations on im(0), namely 

fiX:iL))-X^{L)=^aK^v^, 

t{X:{L)) - X^{L) = ^kKaV" + r(Ma), 

we see that T satisfies the vakonomic equations if and only if (paR'^f^v^ = 0. □ 

Theorem |2] shows that there is a process leading from the nonholonomic dynamics T to 
a vakonomic field f and that therefore weak consistency works in both directions. In 
fact the construction leading to T can be applied to any vector field of second-order type 
tangent to C, with the obvious choice of 'multipliers' A^; if it is applied to Tq we get back 
f . Conversely, if we apply Proposition [4] to T we obtain T . 

The condition in item [ii) of the theorem is, of course, not new. It can also be found in 
other texts such as e.g. [101 US] a-iid it dates back to at least the paper [31] of Rumiantsev. 
It can, however, always be satisfied: one merely has to take (pa = 0, that is, choose the 
zero section. The corresponding vakonomic field f then has a particularly attractive 
form: 

f = r + Aa-^. 

The fact that weak consistency always holds explains why we regard it as weak, and why 
we propose the following stronger version. We say that the nonholonomic and vakonomic 
problems are strongly consistent if the problems are weakly consistent on im(0) for some 
section (p, and if in addition the vakonomic field T is tangent to \m.{(p), or in other words 
(/)*r is vakonomic. 

Corollary 1. The necessary and sufficient condition for the nonholonomic and vako- 
nomic problems to he strongly consistent is the existence of a section tp of the projection 
C X M"-™ ^ C such that (paR^pv'^ = and T{(pa) + 4>bRlaV'^ = ^a- 

Corollary 2. If the nonholonomic and vakonomic problems are strongly consistent, T is 
the restriction to C of an Euler- Lagrange field of a Lagrangian L. 

Proof. Apply Theorem [T] □ 
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It seems to us that Corollary 2 provides what is probably the most interesting con- 
sequence of our analysis of the consistency of nonholonomic and vakonomic problems, 
namely that strong consistency is a sufficient condition for the nonholonomic dynamics 
to be the restriction of an Euler-Lagrange field. 

The introduction of L is the process called by Fernandez and Bloch |19] , in the context 
of Abelian Chaplygin systems, 'the elimination of the multipliers'. To obtain their result 
these authors assume what they call 'conditional variationality', which in that context 
corresponds to our strong consistency. Corollary [2] above is a substantial generalization of 
their result. It is worth emphasising that Theorem [T] itself , which is the major ingredient 
of the corollary, is quite independent of the question of consistency and is concerned just 
with the vakonomic problem. 

Strong consistency has the following consequences so far as individual motions are con- 
cerned. Let 7 be any individual integral curve of T. Let ip^ be such that 

V'°(i?V)(7(0)) = 0. 

The equations 

where all of the variables other than ipa are evaluated at 7(t), have a unique solution. 
Then (7(i), is a solution curve of the vakonomic problem if and only if ipa{t)R^i^v^ = 
for all t. If strong consistency holds, with respect to a section /i^ = (pa, then ipait) = 
4'ai'yit)) satisfies this condition for every 7. On the other hand, it would be possible to 
formulate a partial version of consistency, in which this condition holds along some, but 
not necessarily all, integral curves of F. A definition along these lines is to be found in 
|19| . It seems likely that the conditions for partial consistency case are related to the 
so-called second-order constraints in the algorithm proposed in [10]. 

6 Chaplygin systems 

We now discuss the results of the previous section as they apply to (non- Abelian) Chap- 
lygin systems, which we define immediately below. 

Assume that a Lie group G acts in a free and proper way on the configuration manifold 
Q. Then Q — > Q/G is a principal fibre bundle. A Chaplygin system sometimes 
referred to as a generalized Chaplygin system [6] or as a nonholomic system of 'purely 
kinematic' or 'principal' type |21|3], is a nonholonomic system where the Lagrangian is 
invariant under the induced action of G on TQ, and moreover the constraint distribution 
V is the horizontal distribution of a principal connection on the bundle. For more details 
about principal bundles and principal connections, see e.g. ^j. 

The most natural choice for a frame for a Chaplygin system is one where the vector fields 
Xa = Ea are the fundamental vector fields of the action of the group G and the vector 
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fields Xa span V and in addition are invariant under G. Thus [Ea, Ei,] = —C^^Ec where 
the coefficients are the structure constants of the Lie algebra g of G, and [iJ^^-^a] = 0. 
We have Rl^^ = and R^^jj = —5],G'^^. The formula for the action of a complete lift on a 
quasi-velocity gives 

In particular, E"^ is tangent to C: the group action leaves C invariant. 

The nonholonomic dynamical vector field V obtained from the Lagrange-d'Alembert prin- 
ciple is determined by the fundamental equations 

T{Xl{L)) - X^{L) = 0. 

These may be written in the alternative form 

r(XV(Le)) - = -Rlpvl'EZiL) = -Rlf^vPpa 

where Pa = E^{L) is just the ath component of momentum for the free Lagrangian L 
corresponding to the action of G as a symmetry group of L. The component X^ of X^ 
tangent to C is given in this case by 

Note that X^{L) = X^{L) + R^^vl^Pa. 
Since £^^(L) = 0, the multiplier equation is 

r(pa) = 

the momentum is not conserved by the nonholonomic dynamics, therefore, but it is 
related to the multipliers in a simple fashion. Due to the invariance of L, we have 
E^M = [ES,mL) = -Cl,mL), or 

which says in fact that the momentum, regarded as a 0*-valued function on TQ, trans- 
forms according to the coadjoint action of G. By acting with E'^ on the fundamental 
equations one finds that [i?^, r](X^(L)) = 0. But as we pointed out in Section 2, [-E^jr] 
(the bracket of a complete lift and a vector field of second-order type) is vertical. It is 
also tangent to C. Provided that L is regular with respect to V it follows that [i?^, F] = 0, 
which is to say that the dynamical vector field V is invariant under the action of G on 
C. It follows immediately that 

El{\,) + Gl,X, = Q, 

which says that the multipliers should also be seen as components of a 0*-valued function 

transforming under the coadjoint action. In fact for q € Q the map g — > TqQ by ^ 
identifies g with a subspace of TqQ complementary to Vg (namely the subspace tangent 
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to the fibre). Under this identification P^, the annihilator subsapce of Vq in T*Q, can 
be identified with g*. 

Consider now the vakonomic problem. The Euler-Lagrange equations for a vakonomic 
field f on C X M""™ C TQ x M""™, namely 

t{XUL)) - Xf[L) = fJ.aR^^v'^ + t{l^a)St, 

in this case become 

f(X^(L))-XS(L)=/.,i?>^ 
f(4-(L))=f 

The latter just says that Pa — l^^a, a function on TQ x M""™, is a constant of motion 
for every vakonomic dynamical vector field F, or in other words that every T is tangent 
to the level sets of Pa — fJ-a- Since T is defined only for v"" = 0, we must in fact restrict 
these functions to C x M"^™, and make the more comprehensive statement that every T 
is tangent to the level sets of — /^a in C x M""*". 

We observed in the previous section that it is preferable to restrict the Euler-Lagrange 
equations of the vakonomic problem to the image of some section </) of C x M""*" — > C. We 
have an obvious choice of section in the present case, namely Ha = Pa, the zero level set of 
Pa — fJ-a- From the previous paragraph we see that every vakonomic field T is tangent to 
im((j)). It follows that with this choice of section, there is no difference between weak and 
strong consistency for a Chaplygin system. In fact the second condition of Corollary [T] 
for strong consistency, namely T{(j)a) + (pbRaa'^"' = A^, reduces to r((/)a) = Xa with our 
choice of frame, and is satisfied automatically when (pa = Pa- Indeed, it is satisfied for 
4'a = Pa + ka if T(ka) = 0, that is, if ka is a constant of motion for the nonholonomic 
dynamics. We work with the section (pa = Pa here; in the next section, however, we shall 
show that for the two- wheeled carriage example the consistency conditions can sometimes 
be satisfied with non-zero ka- 

We now consider the situation in the light of Theorem [TJ This concerns the Euler- 
Lagrange equations of the Lagrangian L = L — ^av"" where is any extension of (pa 
off C. In the present case (with (pa = Pa) there is no difficulty in extending (pa off C: 
(pa is just the restriction to C of pa, a well-defined function on TQ. We can therefore 
take L = L — pav"". We shall discuss the dynamics of this Lagrangian in the next few 
paragraphs. 

We need to impose another regularity condition on L. The components of the Hessian 
are denoted as follows: 

X^(Xj(L))=5a/3, X^{E:iL))=gaa, KiE^L)) = gab- 

We now require that the submatrix (gab) of the Hessian is nonsingular. When this holds 
we say that L is regular with respect to Q. 

We consider the Lagrangian L = L — PaV"" on TQ. 
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Proposition 5. If the Lagrangian L of a Chaplygin system is regular with respect to D 
and to Q, the function L = L — pav"" is regular in some neighbourhood of C and invariant 
under the action of G. Moreover, the corresponding Euler- Lagrange field T is tangent to 
C. 

Proof. A short calculation leads to the following expressions for the components of the 
Hessian of L, where E stands for the vector field v'^E^ (so that L = L — E{L)): 

Xl[X}[L))=g^p-E{g^p), 

Xl{El{L)) = -E{g^a), 

EZ{El{L)) = -gab-E{gab). 

Note that = on C, from which it follows that if L is regular with respect to T) and 
to Q then L is regular on C, and therefore regular in some neighbourhood of C at least. 
In such a neighbourhood of C there is a unique second-order differential equation field T 
which satisfies T{Xy{L)) — Xf{L) = 0, the Euler-Lagrange equations of L. 

Now L (as well as L) is invariant under the action of G on TQ : since E^ {pb) + G^^^pc = 0, 

E^{L) = E^iL - pbv') = -{E^ipb) + PcC:,)v' = 0. 

The momentum pa = E^{L) will therefore be conserved by the dynamical field f , or in 
other words T will be tangent to the level sets of pa- But 

Pa = El{L - PaV"-) =Pa- gabV^ - Pa = -Qahv'' ■ 

Thus if L is regular with respect to 0, the zero level oipa is precisely C. We can conclude 
therefore that the Euler-Lagrange field F of L is tangent to C □ 

This proposition extends a result given in [19] for the case of an Abelian Chaplygin 
system to Chaplygin systems in general. 

Corollary 3. Under the assumptions of the previous proposition, the vector field (/>*r'', 
where is the restriction ofT to C, is a solution of the vakonomic problem on the image 
ofct>. 

Proof. This follows from Theorem [ij □ 

The equations determining f are now to be compared with those for the nonholonomic 
dynamics. 

Proposition 6. Under the assumptions of the previous proposition, the restriction 
to C of the Euler-Lagrange field of L equals the nonholonomic field T if and only if 

or, equivalently, if and only if the conditions for weak (and thus strong) consistency are 
satisfied. 
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Proof. The equations determining T are of course the restrictions to C of the Euler- 
Lagrange equations T{Xy{L)) — Xf'{L) = 0. Those with i = a just say that is tangent 
to C. The others may be written 

fO(X^(Z,)) - X^L,) = -K^v^EliL). 

Here Lc is the restriction of L to C: but this is clearly L,.. Now E^{L) = Pa = —Qah^^ = 
on C. So r*^ satisfies 

fO(X^(L,))-X,^(L,)=0. 

Since by assumption L is regular with respect to P, these equations uniquely determine 
r°, which is of second-order type. 

On the other hand, the equations determining the nonholonomic dynamics are 

T{Xl{L,))-X^M = -KpV^Pa. 

We see that the two agree if and only if R'^pV^Pa = 0, which is of course the condition for 
weak consistency, and in fact as we pointed out earlier for strong consistency as well. □ 

This extends further results of [19] on Abelian Chaplygin systems to Chaplygin systems 
in general. 

We pointed out above that (^*r^ provides us with a solution to the vakonomic prob- 
lem, whether or not consistency holds. There are further points of interest about the 
vakonomic problem in this special case which we now discuss. 

First, we can extend the action of G from Q to TD^ in a manner modelled on the 
action on the multipliers discussed earlier, as follows. Recall that at the beginning of 
the previous section we said that the /x^ are really the components of a covector ^ in 
pO ^ T*Q with respect to the coframe that is dual to the chosen frame {XcE^q}. We 
can also interpret therefore as a function on T*Q. In fact each vector field X on Q 
defines a linear function = l^iX^ on r*Q, and //q is the function that corresponds to 
the vector field Ea- On the other hand, each vector field W onQ can be lifted to a vector 
field VF^^^ on T*Q (another complete lift: see [15]) and the relation W^'^\^x) = ^^{[w,x]) 
holds. If we take W = Ea and X = Eb, we get 

Ei^\^^b) = -C^abf^c 

Now the action of G on Q extends to an action on Qxg* = T)° and therefore also induces 
an action on TT)° . The infinitesimal generators of this last action are simply the vector 
fields 

The extended Lagrangian L = L — fLav"", which is a function on TV^, is invariant under 
this action: 



23 



The corresponding momentum components are just pa = Pa—f^a (so far as this calculation 
is concerned, /la is a coordinate on the base). We thus obtain by a different method a 
result that we pointed out earlier, namely that every Euler-Lagrange field of L is tangent 
to the level sets of the functions pa- The zero level set of pa, namely fj-a = Pa, is of course 
the section </> of C x M"^™" C we are using. Not only is T tangent to this, so also is Ca- 

ea{Pb - l^b) = -CabiPc - l^c)- 
The Euler-Lagrange equations for the vakonomic problem may be written 

t{X:{L))-X^{L)=PaKpV^, tiPa)=0. 

Set f = (/)=,f °. Then since fO(X^(L)) - X^{L) = and f ° is tangent to the zero level 
of Pa, we have 

r^{x^{L))-x^{L) = o, r\Pa) = o, 

which shows explicitly that satisfies the Euler-Lagrange equations of L on im{(f)). 

It may not have escaped the notice of the reader that L is actually the Routhian of L 
(see im |27] ) . This does not seem to be of any significance — we are not interested in the 
Euler-Lagrange field of L or the level sets of pa- However, there are Routhian overtones 
to the story. In the first place, we may consider the Routhian of L itself. This is given 

by 

L - paV"" = L- PaV" + QabV^V^. 

(Observe that if L = T—V is of mechanical type with gaa = 0, this Routhian is simply L.) 
Its restriction to the level set = is simply the constrained Lagrangian Lc : C ^ M.. 
Furthermore, the so-called Routh procedure, as described in [11], gives an alternative 
way of deriving the equations for , which are in fact the generalized Routh equations 
for the Routhian of L on = 0. 

Secondly, we may compute the Routhian of L, which is 

L - paV" = L- HaV"^ - {pa - Ha)v'' = L- PaV" = L. 

This is a function on TQ, in other words it is independent of p,a, which itself is unusual 
and interesting. Once again one can use the Routh procedure to derive the equations for 
r restricted to any level set of pa- These will be expressed in terms of L, which reveals 
again the close relationship between this Lagrangian and the vakonomic problem. 

7 Examples 

7.1 A class of nonholonomic systems with constraints of a special form 

As a first application we look at systems on R'^+^, with coordinates {qi, q2,qa) and corre- 
sponding natural fibre coordinates {ui,U2, Ua), a = 3,4, . . . ,k + 2, where the constraints 
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take the rather special form 

Ua + Aa{qi)u2 = 

for some functions A^, and where the Lagrangian is of the type of a EucHdean metric 

L = l (^huj + hul + laul^ . 

(We shall not use Einstein's convention for sums over the index a in this section.) This 
class of nonholonomic systems has been studied in [1] , where the authors investigated, 
using methods different from ours, whether there is a Lagrangian whose Euler-Lagrange 
field coincides with the nonholonomic dynamics when restricted to the constraint sub- 
manifold. Evidently our Corollary 2 is relevant to this question. The class includes 
important classical examples such as the nonholonomic particle, the vertically rolling 
disk, the knife edge on a horizontal plane, the mobile robot with a fixed orientation, etc. 

The frame 

Ijl a _ va —1 

\dqi' dq2 ^ "'dqaj 

spans D; it can be completed to a total frame by adding the vector fields d/dqa- 
These last vector fields are in fact the infinitesimal generators of the M*^-action given 
by ((^a), {qi,q2,qa)) {qiiq2,qa + £a), under which both the Lagrangian and the con- 
straints are invariant (they are even invariant under an M'^'^-'^-action). The systems are 
therefore of Chaplygin type. We write 

= 7^ — , X2 = TT > AaT^ ) Xa = Ea = 7^ . 

oqi oq2 ^ oqa oqa 

In terms of the corresponding quasi-velocities Vi the restriction of the Lagrangian to the 
constraint submanifold C : Va = is just 



The ideal candidate for a section to check strong consistency is given by (j)a = Pa = 
X^(L) = —IaAaV2 on C (no sum over a here). Since the only non-vanishing bracket of 
vector fields in the above frame is 

[Xi,X2] = -Y,KXa, 
a 

the condition (f>aRaf3'^^ — becomes 

^A„A'„^ vl = and la^aA'^j viV2 = 
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on C. Given that, in general, ^1,^2 / 0, this condition is simply ^^/aAaA'^ = 0. This 
last equation is the condition for the function I2 + X^a-^a^a; the coefficient of in L^, 
to be constant; that is to say, it is the condition for Lc to have constant coefficients 
when expressed in terms of the quasi- velocities. The condition has another geometric 
interpretation. The function 



is the invariant measure density of the above nonholonomic system (see |3] ) ; our condition 
is therefore the condition for the system to admit a constant invariant measure density. 

From Corollary [2] we can conclude that if is a constant, the nonholonomic field T is 
the restriction to C of an Euler-Lagrange field of the Lagrangian L = L — ^^Po.'^o,^ which 
turns out to be 



2 (^hv! + N-\l-Y,Iavl^ 



In terms of the original variables we have 

L = 5 huj + hul - ^ laul - ^ AaIaUaU2. 
\ a J a 

This is exactly the statement (in the current terminology) in Proposition 2 of 

Let us come to some details for a few specific examples. The Lagrangian of the nonholo- 
nomic particle is L = + U2 + u|) and its constraint is + qiU2 = 0. One easily 
verifies that the nonholonomic vector field is given by 

(For this example, it so happens that there is no term in .) The vakonomic fields F 
take the form 

f = 771 + V2X^ 

d 

- iiqiXX + ifivi + qiA)X^ + ((1 + qj)A + V1V2 - fiqiVi) X^ + A—, 

where A is arbitrary. The field for which Fc is tangent to C (so that the coefficient of 
X^ vanishes) is the one where 

^ _ Mivi - V1V2 

i + qf 

Obviously, the invariant measure density is not a constant, so there is no strong con- 
sistency on /X = p. On the other hand, Fc and F evidently coincide for /i = 0, which 
illustrates our observation that weak consistency always holds. 
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The vertically rolling disk also belongs to the class. Let R be the radius of the disk. If the 
triple (x, y,z = R) stands for the coordinates of its centre of mass, f for its angle with 
the (x, z)-plane and 9 for the angle of a fixed line on the disk and a vertical line, then 
the nonholonomic constraints are of the form Ux = {Rcosip)ug and Uy = {Rs'mip)ug. 
The Lagrangian of the disk is L = \M{u^ + u^) + \lul + ^Ju^, where / and J are 
the moments of inertia and M is the total mass of the disk. The identification with the 
notations above is {qi,q2',Qa) = {'f,9]x,y). It is easy to see that is a constant here 
and that the nonholonomic and the vakonomic equations are consistent. For a detailed 
calculation of those equations, see e.g. [2]. 

7.2 A two- wheeled carriage 

Consider a two-wheeled carriage which can move on a horizontal plane in the direction 
in which it points and which can spin around a vertical axis; the wheels roll without 
slipping over the plane. This object is sometimes called a planar mobile robot, but we 
do not use this description since we think that the word 'robot' should be reserved for 
devices that are subject to controls of some kind. 




X 



The angle of rotation about the vertical is denoted by 9, and the positions of the wheels 
are characterized by angles V'l and 1^2- We shall generally assume that the intersection 
point (x, y) of the horizontal symmetry axis of the carriage with the line which connects 
the two wheels is not necessarily the centre of mass of the system, but lies at a distance 
i from it. 

The configuration space of the system is S^xS^xSE{2), with coordinates {ipi,tp2, x, y, 9). 
The Lagrangian is 

L = lm{ul + Uy) + moiug {{cos 9) Uy - {sm9)ux) + \ Ju0^ + \ J2{u^^^ + n^^) 

where uiq is the mass of the body, m = tjiq + 2mi is the mass of the complete system, J 
is the moment of inertia of the whole system around a vertical axis through (x, y) and 
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J2 is the axial moment of inertia for the wheel. The constraints are 

R R R 

Ux = -— cos 9{ujp^ + Ujp,^), Uy = -— sin 6{u^^ + u^^), uq = —{u^^ - u^^) . 

Here R is the radius of a wheel, and c is half the length of the axle. This example, 
including the formulae for the Lagrangian and the constraints, can be found in the 
classic reference [30], and in many more texts, see e.g. |18l 1191 [22] (the case where £ = 0) 
and [61 Uni [13 m] (the general case). 

The distribution T) is spanned by the vector fields 

X - — -- (cose— + s\ne— + - — 
^ d-^Ji 2 \ dx dy cdO 

X -—--(cose—+sme—-- — 
^ f?V'2 2 \ dx dy cd6 

The Lagrangian and the constraints are both invariant under the (usual) S'S(2)-action 
and this is an example of a Chaplygin system. By adding the fundamental vector fields 

~ d ~ d ~ d d d 

Xs = Es = —, Xi = Ei = —, X5 = ^5 = — -y— +x — , 
ox oy ot) ox Oy 

we get a full frame. We have vi = u^^, V2 = u^2i ^'^'^ the constraints are simply 
^3 = ^4 = ^5 = 0, where 



u 



X — 2 



\Rcos6{vi + V2) +v^- yv5, 
^Rsm0{vi + V2) + f 4 + XV5, 



y ~ 2 
R 

ue = - — (^1 ~ "^2) + V5. 
2c 

We have written the quasi-velocities implicitly like this to make it easier to calculate the 
expression for L in terms of them. We don't in fact need the full expression, so we shan't 
write it down. First, we give the expression for Lc- 



-^'^ ^ 5 ( ^("^ + + "^2^ {vl + vl) - - mc^)viV2. 

Note that the coefficients are constants. So far as the full Lagrangian is concerned, it 
will shortly become apparent that we need only those additional terms which are linear 
in v-^ and with coefficients involving only vi and V2'- we have 

L = Lc — |mi?(f 1 + V2){vs cos 6 + V4 sin 0) H — (f 1 — f2)('U3 sin 9 — V4 cos 9) + . . . 

We proceed to the calculation of the nonholonomic dynamical vector field. For this we 
need the bracket of X\ and X2: 

[Xi,X2] = ^ ((sin 0)^3 - (cos^)X4) , 
2c 
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from which we obtain the following formulae for derivatives of quasi- velocities: 

2c 2c 

X2{n) = —{sme)vi, x^{vi) = -—{cose)vi] 

2c 2c 

the other derivatives of quasi- velocities along X^, a = 1,2, are zero (in particular, those 
of Va)- The terms not written explicitly in the formula for L above will give zero when 
we calculate X^{L) and set = V4, = = 0. We have 

^1 (-^) = 4^^^^ ~ ^2)^^2, X2 [L) = [Vi - V2)vi. 

Recall that v^X'^ is tangent to C, and note that in this case v°'X'^{v^) = 0. For conve- 
nience we write 



R^ W' 
Lc=\P{v\^v\)-QvxV2, P = ^{J + mc^) + J2, Q = ^{J-mc'^^ 



With r = viX^ + ^2^2° + TiX^ + TaX^ we have 

T{X^{L) - (L) = PFi - QT2 + K{vi - V2)v2 = 
r(Xj(L) - X^{L) = -QTi + PT2 - K{vi - V2)vi = 0, 

where we have written K for ttiqIR^ / Ac? . Then by simple algebra 
K K 

^1 = p2 _ Q2 ~ '^2){QVI - PV2), T2 = p2 _ Q2 ~ V2){Pvi - QV2) ■ 

This agrees with the expression of the dynamics in [24 , and corrects an unfortunate 
misprint in [T7] . Notice that K = when £ = 0; in this case the nonholonomic equations 
say simply that in any motion vi and V2 are constants, that is, that the wheels rotate 
with constant (in general different) speeds. 

In view of the fact that i^fg = 0) to test for consistency we do not need to know p^. From 
the truncated formula for L given earlier, on C we have 



2cK 

P3 = -^'mR{vi + V2) cos 6 + ~ ^2) sin 6*, 



2cK 

P4 = -\mR{vi + V2) sin 6* ^2"('"i ~ "^^2) cos^. 

Thus 

R^ 

PaRi2 = -^{{sm9)p3 - (cos6')p4) = K{vi - V2). 
2c 

It follows that consistency holds (strongly, since this is a Chaplygin system) with (pa = Pa 
if and only if ^ = 0. (Actually, we can choose (p^ arbitrarily subject to the condition 

n^5) = A5.) 
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However, we are not restricted to taking (f)a = Pa- For a Chaplygin system, the second 
condition for strong consistency is r((/)a) = A^; this is certainly satisfied by (pa = Pa, but 
it is also satisfied by (pa = Pa + ka where ka is any constant of motion, T{ka) = 0. So 
we may enquire whether, when ^ / 0, there are constants of motion ^3 and such that 
{pa + ka)Ri2 = 0, that is, such that 

2cK 

Evidently, we should search for constants of motion which are linear in vi and V2 with 
coefficients linear in cos 6 and sin 9. So first let us set 

k = h{e)vi + f2{e)v2. 

Now so far as their action on functions of 9 is concerned, both Xi and X2 are just d/d9 
up to a constant factor. Thus 

T{k) = (wiXf + V2X^ + TiX^ + T2X^){k) 
R 



2c 



(^^1 - V2){fivi + f'2V2) + Ti/i + V2f2. 



It will be convenient to set 
R 



Ti = — (ui - V2){Qvi - PV2) 



R 
2c 



iyi - v2){Pvi - QV2), 



which means taking 



P = 



2cKP 



Q = 



2cKQ 



R{P^-Q^)' 



R{P^ - Q2) 

Then for A; to be a constant of motion we require that 

flVi + f2V2 = {Qvi - Pv2)h + (^'^1 - Qv2)f2, 

and this for all vi and V2- Thus /i and /2 must satisfy 







Q 


p 




" /i " 


J2. 




-P 


-Q . 











.f2 . 







' fl ' 







But then 

Q2 _ p2 

So if wc want f i and /2 to be linear functions of cos 9 and sin 9 we had better ensure that 
q2 _ p2 _ is, E?(^P'^ — Q'^) = Ac^K^, which in terms of the original parameters 

can be written 

^ ^ ^(mi?2 + 2 J2) {R^J + 2c^ 
moR^ 

So when the body of the carriage has the special position relative to the axle which is 
specified by this value of i, the system admits constants of motion of the required form. 
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Assume this condition is satisfied: can we find and A;4 such that 



2cK 

(sin 6)ks - (cos 9)ki = — :p^(vi - V2) = -Kivi - ^2)? 
Inspection of this formula suggests taking 

fes = —K sin 6{vi — V2) + H cos 9{vi + f 2) 

= (-k smO + H cos6)vi + (^sin6' + i7 cos6')t)2, 

ki = K cos 6{vi — V2) + H sin 6'(t;i + u 2) 

= (i^cos6' + iJsin6')ui + [KcosO - Hcos9)v2- 

In order that k^ should actually be a constant of motion its coefficients must satisfy 
f[ = Qfi + P/2; this holds, for P'^ - = 1, provided that 

H = -K{P-Q) = -^{P-Q). 

There are three other conditions to be satisfied if both k^ and /C4 are to be constants of 
motion; they all lead to this same formula for H by virtue of the fact that — = 1. 
In terms of the original parameters 

niR"^ + 2 J2 

^ = — 2ir- 

for what that's worth. The main point is that when i has the special value (in terms 
of the other parameters) given above, the nonholonomic and vakonomic problems are 
strongly consistent, via a section (pa = Pa + ka where the ka are constants of motion 
with k^ and A;4 as above; k^ can be chosen arbitrarily (subject to it being a constant of 
motion). In particular, in such well as in the case i = 0, the nonholonomic 

dynamics is the restriction to C of the Euler-Lagrange field of a Lagrangian L. 

Our solution disagrees with the solution in [10^, but a careful dimensional analysis easily 
shows that the solution in [10^ cannot be correct. 
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